44 research outputs found

    REPARATION : ribosome profiling assisted (re-)annotation of bacterial genomes

    Get PDF
    Prokaryotic genome annotation is highly dependent on automated methods, as manual curation cannot keep up with the exponential growth of sequenced genomes. Current automated methods depend heavily on sequence composition and often underestimate the complexity of the proteome. We developed RibosomeE Profiling Assisted (re-)AnnotaTION (REPARATION), a de novo machine learning algorithm that takes advantage of experimental protein synthesis evidence from ribosome profiling (Ribo-seq) to delineate translated open reading frames (ORFs) in bacteria, independent of genome annotation (https://github.com/Biobix/ REPARATION). REPARATION evaluates all possible ORFs in the genome and estimates minimum thresholds based on a growth curve model to screen for spurious ORFs. We applied REPARATION to three annotated bacterial species to obtain a more comprehensive mapping of their translation landscape in support of experimental data. In all cases, we identified hundreds of novel (small) ORFs including variants of previously annotated ORFs and >70% of all (variants of) annotated protein coding ORFs were predicted by REPARATION to be translated. Our predictions are supported by matching mass spectrometry proteomics data, sequence composition and conservation analysis. REPARATION is unique in that it makes use of experimental translation evidence to intrinsically perform a de novo ORF delineation in bacterial genomes irrespective of the sequence features linked to open reading frames

    Ribosome signatures aid bacterial translation initiation site identification

    Get PDF
    Background: While methods for annotation of genes are increasingly reliable, the exact identification of translation initiation sites remains a challenging problem. Since the N-termini of proteins often contain regulatory and targeting information, developing a robust method for start site identification is crucial. Ribosome profiling reads show distinct patterns of read length distributions around translation initiation sites. These patterns are typically lost in standard ribosome profiling analysis pipelines, when reads from footprints are adjusted to determine the specific codon being translated. Results: Utilising these signatures in combination with nucleotide sequence information, we build a model capable of predicting translation initiation sites and demonstrate its high accuracy using N-terminal proteomics. Applying this to prokaryotic translatomes, we re-annotate translation initiation sites and provide evidence of N-terminal truncations and extensions of previously annotated coding sequences. These re-annotations are supported by the presence of structural and sequence-based features next to N-terminal peptide evidence. Finally, our model identifies 61 novel genes previously undiscovered in the Salmonella enterica genome. Conclusions: Signatures within ribosome profiling read length distributions can be used in combination with nucleotide sequence information to provide accurate genome-wide identification of translation initiation sites

    Profiling of Small Ribosomal Subunits Reveals Modes and Regulation of Translation Initiation

    Get PDF
    Translation initiation is often attributed as the rate-determining step of eukaryotic protein synthesis and key to gene expression control. Despite this centrality, the series of steps involved in this process is poorly understood. Here, we capture the transcriptome-wide occupancy of ribosomes across all stages of translation initiation, enabling us to characterize the transcriptome-wide dynamics of ribosome recruitment to mRNAs, scanning across 5′ UTRs and stop codon recognition, in a higher eukaryote. We provide mechanistic evidence for ribosomes attaching to the mRNA by threading the mRNA through the small subunit. Moreover, we identify features that regulate the recruitment and processivity of scanning ribosomes and redefine optimal initiation contexts. Our approach enables deconvoluting translation initiation into separate stages and identifying regulators at each step.publishedVersio

    Effects of control interventions on Clostridium difficile infection in England: an observational study

    Get PDF
    Background: The control of Clostridium difficile infections is an international clinical challenge. The incidence of C difficile in England declined by roughly 80% after 2006, following the implementation of national control policies; we tested two hypotheses to investigate their role in this decline. First, if C difficile infection declines in England were driven by reductions in use of particular antibiotics, then incidence of C difficile infections caused by resistant isolates should decline faster than that caused by susceptible isolates across multiple genotypes. Second, if C difficile infection declines were driven by improvements in hospital infection control, then transmitted (secondary) cases should decline regardless of susceptibility. Methods: Regional (Oxfordshire and Leeds, UK) and national data for the incidence of C difficile infections and antimicrobial prescribing data (1998–2014) were combined with whole genome sequences from 4045 national and international C difficile isolates. Genotype (multilocus sequence type) and fluoroquinolone susceptibility were determined from whole genome sequences. The incidence of C difficile infections caused by fluoroquinolone-resistant and fluoroquinolone-susceptible isolates was estimated with negative-binomial regression, overall and per genotype. Selection and transmission were investigated with phylogenetic analyses. Findings: National fluoroquinolone and cephalosporin prescribing correlated highly with incidence of C difficile infections (cross-correlations >0·88), by contrast with total antibiotic prescribing (cross-correlations 0·2). Interpretation: Restricting fluoroquinolone prescribing appears to explain the decline in incidence of C difficile infections, above other measures, in Oxfordshire and Leeds, England. Antimicrobial stewardship should be a central component of C difficile infection control programmes

    Cdc14 phosphatase promotes segregation of telomeres through repression of RNA polymerase II transcription

    Get PDF
    Kinases and phosphatases regulate messenger RNA synthesis through post-translational modification of the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II (ref. 1). In yeast, the phosphatase Cdc14 is required for mitotic exit2,3 and for segregation of repetitive regions4. Cdc14 is also a subunit of the silencing complex RENT (refs 5, 6), but no roles in transcriptional repression have been described. Here we report that inactivation of Cdc14 causes silencing defects at the intergenic spacer sequences of ribosomal genes during interphase and at Y′ repeats in subtelomeric regions during mitosis. We show that the role of Cdc14 in silencing is independent of the RENT deacetylase subunit Sir2. Instead, Cdc14 acts directly on RNA polymerase II by targeting CTD phosphorylation at Ser 2 and Ser 5. We also find that the role of Cdc14 as a CTD phosphatase is conserved in humans. Finally, telomere segregation defects in cdc14 mutants4 correlate with the presence of subtelomeric Y′ elements and can be rescued by transcriptional inhibition of RNA polymerase II

    Widespread genomic influences on phenotype in Dravet syndrome, a ‘monogenic’ condition

    Get PDF
    Dravet syndrome is an archetypal rare severe epilepsy, considered “monogenic”, typically caused by loss-of-function SCN1A variants. Despite a recognisable core phenotype, its marked phenotypic heterogeneity is incompletely explained by differences in the causal SCN1A variant or clinical factors. In 34 adults with SCN1A-related Dravet syndrome, we show additional genomic variation beyond SCN1A contributes to phenotype and its diversity, with an excess of rare variants in epilepsy-related genes as a set and examples of blended phenotypes, including one individual with an ultra-rare DEPDC5 variant and focal cortical dysplasia. Polygenic risk scores for intelligence are lower, and for longevity, higher, in Dravet syndrome than in epilepsy controls. The causal, major-effect, SCN1A variant may need to act against a broadly compromised genomic background to generate the full Dravet syndrome phenotype, whilst genomic resilience may help to ameliorate the risk of premature mortality in adult Dravet syndrome survivors

    Biallelic variants in PCDHGC4 cause a novel neurodevelopmental syndrome with progressive microcephaly, seizures, and joint anomalies.

    Get PDF
    PURPOSE: We aimed to define a novel autosomal recessive neurodevelopmental disorder, characterize its clinical features, and identify the underlying genetic cause for this condition. METHODS: We performed a detailed clinical characterization of 19 individuals from nine unrelated, consanguineous families with a neurodevelopmental disorder. We used genome/exome sequencing approaches, linkage and cosegregation analyses to identify disease-causing variants, and we performed three-dimensional molecular in silico analysis to predict causality of variants where applicable. RESULTS: In all affected individuals who presented with a neurodevelopmental syndrome with progressive microcephaly, seizures, and intellectual disability we identified biallelic disease-causing variants in Protocadherin-gamma-C4 (PCDHGC4). Five variants were predicted to induce premature protein truncation leading to a loss of PCDHGC4 function. The three detected missense variants were located in extracellular cadherin (EC) domains EC5 and EC6 of PCDHGC4, and in silico analysis of the affected residues showed that two of these substitutions were predicted to influence the Ca2+-binding affinity, which is essential for multimerization of the protein, whereas the third missense variant directly influenced the cis-dimerization interface of PCDHGC4. CONCLUSION: We show that biallelic variants in PCDHGC4 are causing a novel autosomal recessive neurodevelopmental disorder and link PCDHGC4 as a member of the clustered PCDH family to a Mendelian disorder in humans

    GWAS meta-analysis of intrahepatic cholestasis of pregnancy implicates multiple hepatic genes and regulatory elements

    Get PDF
    Intrahepatic cholestasis of pregnancy (ICP) is a pregnancy-specific liver disorder affecting 0.5–2% of pregnancies. The majority of cases present in the third trimester with pruritus, elevated serum bile acids and abnormal serum liver tests. ICP is associated with an increased risk of adverse outcomes, including spontaneous preterm birth and stillbirth. Whilst rare mutations affecting hepatobiliary transporters contribute to the aetiology of ICP, the role of common genetic variation in ICP has not been systematically characterised to date. Here, we perform genome-wide association studies (GWAS) and meta-analyses for ICP across three studies including 1138 cases and 153,642 controls. Eleven loci achieve genome-wide significance and have been further investigated and fine-mapped using functional genomics approaches. Our results pinpoint common sequence variation in liver-enriched genes and liver-specific cis-regulatory elements as contributing mechanisms to ICP susceptibility
    corecore